ON THE MATRIX EQUATION XA - AX = X p 



DIETRICH BURDE 

Abstract. We study the matrix equation XA — AX = X p in M n (K) for 1 < p < n. It is 
shown that every matrix solution X is nilpotent and that the generalized eigenspaces of A are 
X-invariant. For A being a full Jordan block we describe how to compute all matrix solutions. 
Combinatorial formulas for A m X l 1 X l A m and (AX) e are given. The case p = 2 is a special 
case of the algebraic Riccati equation. 



1. Introduction 
Let p be a positive integer. The matrix equation 

XA — AX = X p 



2 



arises from questions in Lie theory. In particular, the quadratic matrix equation XA—AX = X 
plays a role in the study of affine structures on solvable Lie algebras. 

An affine structure on a Lie algebra g over a field K is a iC-bilinear product g x g — > g, 
(x, y) i— > x • y such that 

x ■ (y • z) — (x ■ y) • z = y • (x ■ z) — (y • x) • z 
[x,y] = x ■ y - y ■ x 

for all x,y,z 6 g where [x, y] denotes the Lie bracket of g. Affine structures on Lie algebras 
correspond to left-invariant affine structures on Lie groups. They are important for affine 
manifolds and for affine crystallographic groups, see [I], 0. 

We want to explain how the quadratic matrix equations XA — AX = X 2 arise from affine 
structures. Let g be a two-step solvable Lie algebra. This means we have an exact sequence of 
Lie algebras 

with the following data: o and b are abelian Lie algebras, (p : b i— > End(a) is a Lie algebra 
representation, Q 6 Z 2 (b, a) is a 2-cocycle, and the Lie bracket of g = a x b is given by 

[(a, x), (6, y)} := ((f(x)b - (p(y)a + Q(x, y),0). 

Let u : b x b — > a be a bilinear map and <pi, ip? : b i— > End(a) Lie algebra representations. A 
natural choice for a left-symmetric product on g is the bilinear product given by 

(a, x) o (6, y) := (ipi(y)a + ^>2(x)b + uj(x, y),0). 

One of the necessary conditions for the product to be left-symmetric is the following: 

Vi{ x )v{y) - <p(v)<pi(x) = <pi(y)<pi(x). 
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2 D. BURDE 

Let (ei, . . . e m ) be a basis of b and write Xi := and Aj := tp(ej) for the linear operators. 

We obtain the matrix equations 

XiAj — AjXi = XjXi 

for all 1 < i,j < m. In particular we have matrix equations of the type XA — AX = X 2 . 



2. General results 

Let K be an algebraically closed field of characteristic zero. In general it is quite difficult to 
determine the matrix solutions of a nonlinear matrix equation. Even the existence of solutions 
is a serious issue as illustrated by the quadratic matrix equation 



X 2 



1 




which has no solution. On the other hand our equation XA — AX = X p always has a solution, 
for any given A, namely X = 0. However, if A has a multiple eigenvalue, then we have a lot 
of nontrivial solutions and there is no easy way to describe the solution set algebraically. A 
special set of solutions is obtained by the matrices X satisfying XA — AX = = X p . First one 
can determine the matrices X commuting with A and then pick out those satisfying X p = 0. 
Let E denote the n x n identity. We will assume most of time that p > 2 since for p = 1 we 
obtain the linear matrix equation AX + X(E — A) =0 which is a special case of the Sylvester 
matrix equation AX + XB = C. Let S : M n (K) -> M n (K) with S(X) = AX + XB be the 
Sylvester operator. It is well known that the linear operator S is singular if and only if A and 
—B have a common eigenvalue, see [3]. For B = E — A we obtain the following result. 

Proposition 2.1. The matrix equation XA — AX = X has a nonzero solution if and only if 
A and A — E have a common eigenvalue. 

The general solution of the matrix equation AX = XB is given in [2 . We have the following 
results on the solutions of our general equation. 

Proposition 2.2. Let A e M n (K). Then every matrix solution X G M n (K) of X A- AX = X p 
is nilpotent and hence satisfies X n = 0. 

Proof. We have X k (XA — AX) = X k+P for all k > 0. Taking the trace on both sides we obtain 
tr(X fc+p ) = for all k > 0. Let Ai, . . . , A r be the pairwise distinct eigenvalues of X. For s > 1 
we have 



tr(X s ) = ^m^. 



i=l 



For s > p we have tr(X s ) = and hence 

r 

£KAf)A* = 

i=i 

for all k > 0. This is a system of linear equations in the unknowns Xi = mi\ p for i — 1, 2, . . . , r. 
The determinant of its coefficients is a Vandermonde determinant. It is nonzero since the Aj 
are pairwise distinct. Hence it follows m^Af = for all i — 1, 2, . . . , r. This means Xi = A2 = 
• • • = A r = so that X is nilpotent with X n = 0. □ 
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Since for p = n our equation reduces to X n = and the linear matrix equation XA = AX, 
we may assume that p < n. 

Proposition 2.3. Let K be an algebraically closed field and p be a positive integer. IfX,A e 
M n (K) satisfy XA — AX = X p then X and A can be simultaneously triangularized. 

Proof. Let V be the vector space generated by A and all X 1 . Since X is nilpotent we can choose 
a minimal m G N such that X m = 0. Then V = span{A, X, X 2 , . . . ,X m ~ 1 }. We define a Lie 
bracket on V by taking commutators. Using induction on t we see that for all I > 1 

(1) X e A-AX e = £X p+e - 1 

Hence the Lie brackets are defined by 

[A,A]=0 

[A, X l \ = AX i - X' l A = -iX p+i - 1 
[X\X j ] = 0. 

It follows that V is a finite-dimensional Lie algebra. The commutator Lie algebra [V, V] is 
abelian and V/[V, V] is 1-dimensional. Hence V is solvable. By Lie's theorem V is triangular- 
izable. Hence there is a basis such that X and A are simultaneously upper triangular. □ 

Corollary 2.4. Let X,Ae M n (K) satisfy the matrix equation XA-AX = X p . Then A i X k A> 
is nilpotent for all k > 1 andi,j > 0. So are linear combinations of such matrices. 

Proof. We may assume that X and A are simultaneously upper triangular. Since X is nilpotent, 
X k is strictly upper triangular. The product of such a matrix with an upper triangular matrix 
A 1 or A 7 is again strictly upper triangular. Moreover a linear combination of strictly upper 
triangular matrices is again strictly upper triangular. □ 

Proposition 2.5. Let p > 2 and A e M n (K). If A has no multiple eigenvalue then X = is 
the only matrix solution of XA — AX = X p . Conversely if A has a multiple eigenvalue then 
there exists a nontrivial solution X^O. 

Proof. Assume first that A has no multiple eigenvalue. Let B = (e±, . . . , e n ) be a basis of K n 
such that A = (a^) and X = (x^) are upper triangular relative to B. In particular = for 
i > j and x^ = for i > j. Since all eigenvalues of A are distinct, A is diagonalizable. We 
can diagonalize A by a base change of the form C{ \— > fi±ei + /i 2 e2 + • • • + z^e, which also keeps 
X strictly upper triangular. Hence we may assume that A is diagonal and X is strictly upper 
triangular. Then the coefficients of the matrix XA — AX = (cjj) satisfy 

for % > j. 



Xi 



K a jj 



Xij 



n for all 
It follows that 



Consider the lowest nonzero line parallel to the main diagonal in X. Since ajj — a 
i 7^ j this line stays also nonzero in XA — AX, but not in X p because of p > 2 
X = 0. 

Now assume that A has a multiple eigenvalue. There exists a basis of K n such that A has 
canonical Jordan block form. Each Jordan block is an matrix of the form 



J(r, A) = 









Vo 



1 

A 









A 








1 

A/ 



e M r (K). 
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For A = we put J(r) = J(r, 0). It is J(r, A) = J(r) + A-E 1 . Consider the matrix equation 
XJ(r, A) - J(r, \)X = X p in M r (K). It is equivalent to the equation XJ(r) - J(r)X = Xp. If 
r > 2 it has a nonzero solution, namely the r x r matrix 

X 

Indeed, XJ(r) — J(r)X = = X p in that case. Since A has a multiple eigenvalue, it has a 
Jordan block of size r > 2. After permutation we may assume that this is the first Jordan 
block of A. Let X G M r (K) be the above matrix and extend it to an n x n-matrix by forming 
a block matrix with X and the zero matrix in M n _ r (K). This will be a nontrivial solution of 
XA- AX = X p in M n (K). □ 

Lemma 2.6. Let A,X G M n (K) and Ai = SAS" 1 , Xi = SXS" 1 /or some 5 € GL n (K). 
Then XA- AX = X p if and only if X X A X - A X X X = Xf . 

Proof. The equation X p = — AX is equivalent to 

xf = (sxs-'y = sx p s~ 1 

= (SXS-^iSAS- 1 ) - (SAS-^iSXS- 1 ) 
= XiAi - A X X X . 

□ 

The lemma says that we may choose a basis of K n such that A has canonical Jordan form. 
Denote by C(A) = {S G M n (K) | SA = AS} the centralizer of A G M n (K). Applying the 
lemma with A 1 = SAS^ 1 = A, where S G C(A) fl GL n (K), we obtain the following corollary. 

Corollary 2.7. // X is a matrix solution of XA — AX = X p then so is X = SXqS^ 1 for any 

5 G C(A) n GL n {K). 

Let i3 = (ei, . . . , e n ) be a basis of K n such that A has canonical Jordan form. Then A is a 
block matrix 

A = diag(A 1 ,A 2 ,...,A fe ) 

with Aj G M r .(K) and A leaves invariant the corresponding subspaces of K n . Let X satisfy 
XA — AX = X p . Does it follow that X is also a block matrix X = diag(X l5 . . . ,X r ) with 
Xi G M r . (K) relative to the basis B ? In general this is not the case. 

Example 2.8. The matrices 

A = I 1 I , X 

satisfy XA- AX = X 2 . 

Here A = diag(J(l), J(2)) leaves invariant the subspaces spanjei} and span{e 2 ,e 3 } corre- 
sponding to the Jordan blocks J(l) and J(2), but X does not. Also the subspace kerA is 
not X-invariant. This shows that the eigenspaces E x = {x G K n \ Ax = \x} of A need not 
be X-invariant. However, we have the following result concerning the generalized eigenspaces 
H x = {x G K n | (A - \E) k x = for some k > 0} of A. 
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Proposition 2.9. Let A,X G M n (K) satisfy XA - AX = X p for 1 < p < n. Then the 
generalized eigenspaces 7i\ of A are X -invariant, i.e., X7i\ C Ti,\. 

Proof. Let A be an eigenvalue of A and H\ be the generalized eigenspace. We may assume 
that A has canonical Jordan form such that A = diag(Ai, A 2 ) with A\ = J(r, A). We may 
also assume that A = 0. This follows by considering B = A — XE instead of A which satisfies 
XB — BX = XA — AX = X p . Let v G 7io- Then there exists an integer m > such that 
A m v = 0. Let r be an integer with r > n. We have X r = 0. By induction on k > 1 we will 
show that 

A m+k~l x r-k(p-l) v _ q for 1 < k < 

p — 1 

This implies the desired result as follows: set r = 1 + k(p — 1). We can choose > 1 such that 

r > n. Then A m + k ~ x Xv = and hence Xv G ft - 

For fc=lwe have to show A m X r ~^' p ~ v * l v = 0. By we have 

AXr-tp-i) _ x r -(p-^A = (p-l- r)X r = 0. 
Hence A and X r ~( p ~ 1 ^ commute. It follows that also A m and X 1 '^^ 1 ^ commute. Hence 

A m X r-(p-l) v = X r-(p-l) A m v = Q 

Assume now that y 4 m + fc - 2 x r ~ (fc ^ 1)(p_1) -u = 0. Then we have 

Axr _ fc(p _l) _ x r~k( P -l) A = ( k(j) _ ^ _ ^ x r-(k-l)(p-l) _ 

Now we will use the following formula: let s,£ > 1 be integers and A,X G M n (K) satisfying 
— AX = X p , where 1 < p < n. Then there exist integers bj = bj(p, £, s) such that 

s 

A s X e = J2b j X e+j( P-VA s - j . 

3=0 

This formula can be easily proved by induction. We will compute explicitly the coefficients bj 
in the last section, see formula (fTU|) . If we use the formula for £ = r — k{p — 1) and s = m + k — 2 
then we obtain 

m+k-2 

A m+k-2 X r-k(p-l) _ fo^ X r+(j-k)(p-l) A m+k-2-j 

j=0 



It follows 



A m+k-l X r-k(p-l) v _ A m+k-2 X r-k(p-l) Av 



m+k-2 

— ^.^v; r '+(i- fc )(p- 1 )^4 m + fc -i- 1 t , 

j=0 

Here all terms with j > k vanish since X r = 0. On the other hand, A m+k - j ~ l v = for all 
j < k - 1. It follows A m+fc - 1 X r - fc (P- 1 )?j = 0. □ 

Corollary 2.10. Let A = di&g(Ai, A2) be a block matrix such that A\ G M r (K) and A2 G 
M S (K) have no common eigenvalues. Then any matrix solution X G M n (K) of X A — AX = X p 
for 1 < p < n is a block matrix X = diag(X!,X 2 ) with X x G M r (K) and X 2 G M S (K) such 
that X 1 A 1 - A X X X = Xf and X 2 A 2 - A 2 X 2 = Xf . 

This says that looking at the solutions of XA — AX = X p we may restrict to the case that 
A has exactly one eigenvalue A G K. Without loss of generality we may assume that A = 0. 
We can say more on the solution set if A has some particular properties. The most convenient 
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special case is that A = J(n) is a full Jordan block. Then we can determine all matrix solutions 
of XJ(n) — J(n)X = X p . This is done in the following section. 

3. The case A = J(n) 

We have already seen that kerA in general is not X-invariant. However, it is true if kerA is 
1-dimensional. But this is the case for A = J{n). 

Lemma 3.1. Let A,Xe M n (K) satisfy XA — AX = X p for 1 < p < n and assume that A is 
an eigenvalue of A with 1-dimensional eigenspace E\ generated by v G K n . Then Xv = 0. 

Proof. We will show that X £ v = implies X^"^" 1 ^ = for all £ > p. By (JTJ we have 

(2) AX e -( p -V - X e -( p -^A = (p - 1 - £)X l . 

Using Av = Xv and X l v = we obtain (A - XE)X £ - (p -^v = so that X'-^h G E\ = 
span{t>}. Hence X e ~( p ~^v = fiv for some jx G K. But since X is nilpotent we have /i = 0. 

Now we repeat this argument starting with X n v = 0. If we arrive at X k v = and k < p 
then X p v = and in the next step Xv = 0. □ 

Proposition 3.2. Let A = J(n) and X G M n (K) be a matrix solution of XA — AX = X p for 

1 < p < n. Then X is strictly upper triangular. 

Proof. Let (ei, . . . e n ) be the canonical basis of K n . Then ker A = spanjei} and Xe± = by 
the above lemma. Now we can use induction by writing 



X = 







en 


be 



with X\ G M n _i(K). It holds X\J{n — 1) — J(n — l)Xi = Xf so that X\ is upper triangular 
by induction hypothesis. Hence X is also upper triangular. □ 

Proposition 3.3. Let p be an integer with 1 < p < n and let A = J(n), X = (xjj) G M n (K). 
Then X is a matrix solution of XA — AX = X p if and only if 

(3) Xij = for all 1 < j < i < n 

j-p+l j-p+2 j-l 

( 4 ) Zvj-l - Zi+lj = "' Y x i,h x h,i2 ' ' ' Xlp-2,tp-i x £p-i,j 

for all 1 < i < j < n. 

Proof. By proposition 13. 21 we know that X is upper triangular. Hence holds. The equations 
follow by matrix multiplication. The (i,j)-th coefficient of XA — AX is just the LHS of 
P)l whereas the (z,j)-th coefficient of X p is given by the RHS of (@J). This may be seen by 
induction. □ 

Remark 3.4. We can solve the polynomial equations given by (jlj) recursively. For p > 3 every 
Xi+ij can be expressed as a polynomial in the free variables x\^ ■ ■ ■ , £i, n since the RHS of (JH) 
does not contain Xj+ij. For p = 2 however it does contain for £ = i + 1. In that case we 

rewrite the equations as follows. 



XA — AX = X v 



£i=i+2 



For j = z + 2 we obtain 2^+1^+2(1 + Xj.j+i) = Xj.i+i- This shows that 1 + x^+i is always 
nonzero. It follows that every Xi+xj is a polynomial in xi )2 , ■ ■ ■ ,xi, n divided by a product of 
factors 1 + kxi,2 also being nonzero. The formulas can be determined recursively. The first two 
are as follows: 

Xl,2 



x i+l,i+2 



x i+l,i+3 



1 + ixip 

Xi, 3 (l +x 1<2 ) 
(l + ixi, 2 )(l + (1 + 1)3:1,2) 



Example 3.5. Let n = 5 and A = J(5). T/ien a// matrix solutions X 
XA — AX = X 2 are given by 



\ 



(x y ) e Of 



X 



(0 


x 12 


X13 


X14 


Xl5 












(l+2xi 2 ) 2 a;i4-(l+a;i2)a:i3 


1+X12 


l+2x 12 


(l+a;i2)(l+2xi 2 )(l+3a;i2) 











X12 


(1+3:12)0:13 


l+2x 12 


(l+2xi 2 )(l+3a!i2) 














X12 

l+3xi2 


1° 















/ 

Corollary 3.6. Let A = J(n). A special matrix solution of X A — AX = X 2 is given as follows: 

\ 





/o 


a 











a 
1+a 






































l+2a 










l+(n-2)a 





In some cases all matrix solutions of XJ(n) — J(n)X = X 2 are conjugated to Xq. 



Proposition 3.7. Let A = J{n) and X 



G M n (K) be a matrix solution of XA — AX 



X 2 with X\2 — 01 7^ 0. Then there exists an S G GL n (K) n C(A) such that X = SX S 1 . 

Proof. We will prove the result by induction on n. The case n = 2 is obvious. For A = J(n) we 
have C(A) = {ciA° + C2A 1 + ■ ■ • + c n A n ~ l \ Ci G K}. The matrix solution X is strictly upper 
triangular by proposition 13.21 We have 



X 



( x ' 




\0---Q 
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with X' G M n -x(K). It is easy to see that XA - AX = X 2 implies X'A' - A'X' = X' 2 with 
A' = J(n — 1). Hence by assumption there exists an S' G GL n _x(K) R C(A') such that 



s'x's'- 1 = X' 



where X' Q is the special solution in dimension n — 1. We can extend S' to a matrix S\ G 
GL n (K) n C(A) as follows: 



Si 



/ 


s n \ 


S" 






S 2 


\0---0 


sj 



sx s 2 s n -i 




\0 



si s 2 
■■■ s x ) 



One verifies that 





( 






SxXS^ 1 = 


S' 








* 






\0---0 










ri \ 














r n ~i 




\0---0 








/ 




/ 




X' 










* 




* 


\0---0 


0/ 


\o---o 


sx 1 ) 



The last matrix is not yet equal to Xq. It is however a solution of XA — AX = X 2 by corollary 
12.71 A short computation shows that this is true if and only if 

= n(l + 0' - l)a) for z = 2,3,...,n-2 

a 



Tn-l 



Hence we have r 2 
with 



1 + (n- 2)a 

r n „2 = 0. It remains to achieve ri = 0. This is done by conjugating 



S- 2 



(\ 
1 




\0 



r 0\ 

r 



1 
1/ 



where r = ri ^ 1+ 4 ^ 2 2 ^ . Note that we have by assumption Let S := S 2 Sx G GL n (K) fl 

C(A). We obtain SXS" 1 = X . □ 

The solutions X with xi 2 = need not be conjugated to X . 

Example 3.8. Let A = J(8) and X = a A 4 + (3 A 5 + 7A 6 + 5A 7 . Then XA - AX = = X 2 
and SXS -1 = X for all S G GL n (K) D C(A). In particular X is not conjugated to X . 

In general we may assume that A is a Jordan block matrix with Jordan blocks J{rx), • • • , J{t}~)- 
If we have found solutions Xx, . . . ,X k to the equations X^ J{r\) — J(rAXi = Xf for i — 1, 2, . . . , k 
then X = diag(Xi, . . . , Xf.) is a solution of XA — AX = X p . However these are not the only 
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solutions in general. How can one determine the other solutions ? One way would be to classify 
the matrix solutions of XA — AX = X p up to conjugation with the centralizer of A. First 
examples show that this classification will be complicated. The following examples illustrate 
this for A = diag( J(2), J(2)) and p = 2, 3. 

Example 3.9. The matrix solutions of XA — AX = X 2 with 

/0 1 o o\ 


1 
\0 0/ 

are given, up to conjugation with S G GL±{K) n C(A), by the following matrices: 



X 1 



X. 



(0 


-l 





0) 




(o 


-1 





o\ 








1 





, x 2 = 











1 











1 
















V° 








°y 




v° 








V 


/o 


a 





°\ 






/o 


a 





A 







































p 













a 


V° 








°) 






v° 








V 



X : 



3,a 



/o 1 \ 

a 

1 -a 

\0 / 



4,a,P 



The solutions X^ a $ = diag(Y"i, Y 2 ) arise from the solutions of the equations J(2)l^— Yi J(2) = 
X 2 . The solutions satisfying X 2 = = XA — AX are given by X 3; i, -^4^^ and X 5iQ . The result 
is verified by an explicit computation. The centralizer of A consists of matrices of the form 





S3 



S2 
Si 
S 4 

S3 



S5 



S7 




Sq\ 

S5 
ss 

s 7 ; 



whose determinant is given by (S1S7 — S3S5)' 



Example 3.10. The matrix solutions of XA — AX = X 3 with A = diag(J(2), J(2)) are given, 
up to conjugation with S G GL^K) n C(A) ; by the following matrices: 



X. 



3,a,/3 



/0 a 0\ 
/5 
0^00 

\0 0/ 



/o « o\ 



/5 

\0 0/ 



X- 



X. 



4,a 





/o 


a 


1 


0\ 













1 


















v° 








V 


/o 


Q! 





1\ 



























a 




v° 













The only solutions satisfying X 3 7^ are Xi jQ , j( g where a ^ [3. 
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4. The case p = 2 

For p = 2 our matrix equation is given by X 2 = XA — AX. This equation is a special case of 
the well known algebraic Riccati equation. There is a large literature on this equation, see [3] 
and the references therein. In particular, there is a well known result on the parametrization 
of solutions of the Riccati equation using Jordan chains. The consequence is that matrix 
solutions can be constructed by determining Jordan chains of certain matrices. This does not 
mean, however, that we are able to solve the algebraic Riccati equation explicitly. The problem 
is only reformulated in terms of Jordan chains. Nevertheless this is an interesting approach. 
We will apply this result to our special case and demonstrate it by an example. 
The algebraic Riccati equation is the following quadratic matrix equation |3j 

XBX + XA- DX -C = 

where A, B, C, D have sizes n x n,n x m,m x n and m x m respectively. Here m x n matrix 
solutions X are to be found. The special case m = n and B = —E, D = A, C = yields 
XA - AX - X 2 = 0. 

Definition 4.1. A Jordan chain of an n x n matrix T is an ordered set of vectors x\, . . . x r G K n 
such that x\ ^ and for some eigenvalue A of T the equalities 

(T - \E)x l = 

(T — \E)x2 = x\ 



(T - \E)x r 



x 



r-l 



hold. 



The vectors called generalized eigenvectors of T associated with the eigenvalue 

A and the eigenvector x\. The number r is called the length of the Jordan chain. 
We call the n-dimensional subspace 

~ E~ 



G(X) 



1m 



X 



c K 



•in 



the graph of X. Denote by T G M 2n (K) the matrix 

\A 







-E 

A 



Then we have the following simple result 0: 



Proposition 4.2. For any n x n matrix X , the graph of X is T -invariant if and only if X is 
a solution of XA - AX = X 2 . 

Representing the T-invariant subspace G(X) as the linear span of Jordan chains of T, we 
obtain the following result j3]. 

Proposition 4.3. The matrix X G M n (K) is a solution of X A — AX = X 2 if and only if there 
is a set of vectors v\, . . . , v n G K 2n consisting of sets of Jordan chains for T such that 



XA — AX = X p 11 
where y^z^ G K n and . . . ,y n ) forms a basis of K n . Furthermore, if 

Y=[yi V2 ■■■ y n ]eM n {K), Z=[z± z 2 ■■■ z n ] g M n (K), 

every matrix solution of XA — AX = X 2 has the form X = ZY~ X for some set of Jordan 
chains v i, . . . , v n for T, such that Y is nonsingular. 

It follows that there is a one-to-one correspondence between the set of solutions of XA—AX = 
X 2 and a certain subset of n-dimensional T-invariant subspaces. 

Example 4.4. Let A = diag(J(2), J(2)) G M 4 (K) and 



A -E 
A 



G M 8 (K) 



Then a set of Jordan chains for T is given by 

ux = (2, 0,0, 0,0, 0,0,0)* 
« 2 = (0, 1,0, 0,-1, 0,0,0)* 
^ = (0,0,-1,0,0,-1,0,0)' 
^ = (0,0,0,1,0,0,1,0)' 

We have Tvi = 0, Tv 2 = vi, Tv 3 = v 2 and Tt> 4 = t> 3 . Then 



X = ZY- 



/0 -1 0\ /2 0\ 
-1 1 
1 -1 

\o o oo/yoooiy 

is a matrix solution of XA — AX = X 2 , see example \3.iA 



/0 -1 o o\ 

10 

1 

\0 0/ 



Note that Jordan chains of length three for T are given by vectors of the form 

vx = (2ai,0,2a 2 , 0,0, 0,0,0)' 
t> 2 = (a 3 , oi, a 4 , «2, — ai, 0, -a 2 , 0)' 

u 3 = ( a 5 ; a 6 ; a 7? a 85 a 6 — a 3 ; ~" a l5 a 8 — a 4i ~ a 2)* 



5. Combinatorial formulas 

The matrix equation XA — AX = X p may be interpreted as a commutator rule [X, A] = X p . 
Successive commuting yields very interesting formulas for X i A m and A m X e , where £, m > 1. 
For m = 1 the formulas are easy: we have X e A = AX 1 + iX^' 1 and AX £ = X^A - £X £+p -\ 
For m > 2 these formulas become more complicated. Finally we will prove a formula for (AX) e . 
Although it is not needed for the study of solutions of our matrix equation, we would like to 
include this formula here. In fact, the commutator formulas presented here are important for 
many topics in combinatorics. We are able to prove explicit formulas involving weighted Stirling 
numbers. 

Proposition 5.1. Letp > 1 and let X, A G M n (K) satisfy the matrix equation XA—AX = X p . 
Then for £, m > 1 we have 
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(5) X l A m = Y. a ^\ k ) A m - k X e+k ^ p 



-1) 



where a^(0) = 1 and a^{k) = a^ p {k) = n^=o[^ + j(P ~ !)]• particular we have 

(6) X^A = AX £ + fX^+P" 1 

(7) X^A 2 = A 2 X e + 2£AX e+p ~ l + £(£ + p- \)x l+2 ^ 

x e A 3 = A 3 X e + 3M 2 X £+ P -i + U ^ + p _ 1 } AX e+2( P -i) + i( i + p _i)(i + 2 (p _ i)) 

For p = 2 i/ie formula simplifies to 



(9) 



Proof. For the case m — 1 see (JTJ. Now (j3J) follows by induction over m. Note that a^/c + 1) = 
a £ {k){£ + k{p-l)). 

rn , v 

= S ( 7 ) Am ~ fe (^x^^- 1 ) + (£ + fc(p - i))x £+ ( fc+i )( p - 1 )) 

fc=0 ^ ' 

fc=0 ^ ' fc=0 ^ ' 

= A m+1 X e + [?)a,(k)A m+1 - k X i+k ^ 
k=i ^ ' 

+ ( , _ : ) [(€ + (A: - l)(p - l))a/(fc - 1)] ^ m+1 - fc x £+fc(p_1) 
fc=i ^ ' 

+ {£ + m{p - l))a £ (m)X^ +(m+1)(p - 1) 

For p = 2 we have a e (k) = £(£ + I) ■■■(£ + k - 1) = k\ (^V) • D 
In the same way one can prove the following result by induction: 

Proposition 5.2. Letp > 1 and let X, A e M n (K) satisfy the matrix equation X A— AX = X p . 
Then for £,m > 1 we have 



XA — AX = X p 13 



m , x 

(10) A m X e = ^(-l) fc a«(fc) ( J x^fr- 1 )^-*. 

fc=0 ^ ' 

Proposition 5.3. Let p > 2 and let X,Ae M n (K) satisfy the matrix equation XA—AX = X p . 
Then we have for all £ > 1 



(11) (AX) 1 = J2 c ( £ ' k)A e - k X e+k{p 



i) 



k=0 



where the numbers c(£, k) = c(£, k, p) are defined by the following recurrence relation for 1 < 
k<£. 



(12) c(£,0) = l 

(13) c(£,£) = 

(14) c{£ + l,k) = c(£, k) + [£ + (p— l)(k - 1)] c(£, k - 1) 
Proof. We proceed by induction on I. Using we obtain 

e-i 



{AXy +1 = {AX) 1 AX = ^c{£, k)A e - k (X e+k{p - 1] A) X 

k=0 

e-i 

= c(£, k)A e ~ k [AX i+k{p ^ + (£ + k(p - l))X l ^ k+1 ^ p -^} X 

k=0 

i-i e-i 
= £ c{£, k )A e+1 - k X e+1+k ^ + c ( £ > k )( £ + k (P ~ \))A l - k X l ^ k ^- x \ 

k=0 k=0 

Using dHJ) it follows 

i-i 

(AX) e+1 = A i+1 X i+1 + k)A t+1 - k X i+1+k ^ 

k=l 

I 

+ c ( ( i k - 1)(£ + (k - l)(p - i))A e+1 - k X £+1+kl *- l) 

k=l 

= A i+1 X e+1 + ^Tc(£+ 1, k)A e+1 - k X e+1+k ^ 

k=l 

+ c(£, £ - l){£ + {£ - l){p - l))AX e+1+e( P-V 



J2c{£+l,k)A e+1 - k X e+1+k ( p - 



-i) 

fc=0 

□ 
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The integers c(£, k, p) are uniquely determined. The following table shows the values for 

J-l 



= 1,2 


5 • - 


. 6 and k = 0, 


e\k 





1 


2 


i 


1 






2 


1 


1 




3 


1 


3 


p+ 1 


4 


1 


6 


4p + 7 


5 


1 


10 


5(2p + 5) 


6 


1 


15 


5(4p+13) 



(p+l)(2p + l) 
5(p+l)(2p + 3) 
15(p + 2)(2p + 3) 



(p+l)(2p+l)(3p+l) 
(p+l)(36j5 2 + 70j5 + 31) 



(p + l)(2p + l)(3p+l)(4j>+l) 



It is possible to find an explicit formula for the c(£, k,p). 
Proposition 5.4. For £ > 1 and < A; < £ — 1 we /iave 



£-1 



(15) 



m = (p - ^ + a - p)*-]- 



For p = 2 the formula reduces to 
(16) c(£,k,2) 



+ k- 1 

2A; 



' .7=1 



Proof. Let 5(n, fc) = 5(n, fc, A 1 6*) denote the weighted degenerated Stirling numbers for n, k > 1, 
see 0. They are given by 



(17) 
(18) 
(19) 



5(n, n) = 1 

n-1 

5(n,0) = IJ(A-^) 
5(7i + 1, k) = (k + A - 07i)5(n, fc) + 5(71, fc - 1). 



They satisfy the following explicit formula (see (4.2) of [U] 



(20) 



k (_i)k+r n ~ 1 

s(n,k^\e) = J2i T T J -^U^ + r -^- 



r! (k — r)\ 

r=0 v ' j=0 



We can rewrite the recurrence relation ()14j) for the numbers c(£, k) as follows. If we substitute 
k by £ — k + 1 then we obtain 



c{£ + 1, £ - k + 1) = c(£, £ - ife + 1) + [£ + (p - 1){£ - k)} c{£, £ - k). 

Here £ — k + 1 runs through 1, 2, ... £ if k does. Now set c(£, £ — k) = s(£, k)(l — p) e ~ k . Then 
the above recurrence relation implies that 



XA — AX = X v 
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s(£,£) = c{£,0) = 1 
s(£,0) = c{£,£)(l- P y 

s(£ + l,k) = s(£,k-l) + 



e-i 

j=0 



k + 



P 



1 — p 



s{£, k). 



Comparing this with (fT7 j) . (p|) . (p|) we see that s(£, k) = S(£,k,X\9) for A = and 9 = 

So they are indeed degenerated Stirling numbers. Applying the formula tflty to c(£, k) = 
s(£, £ — k) (1 — p) k we obtain 



c(£,k, P ) = (p-i) k J2^—Y ) 



r=l 



(£-k-r) 



n 

3=1 



PJ 
p — 1 



— r 



This shows (|15jl . For p = 2 one can obtain a much easier formula. It is however easier to derive 
this formula not from (J15|) but rather by direct verification of the recurrence relation. □ 

Remark 5.5. We have the following special cases: 



c(£,l,p) 



c(£,2,p) 



l)(l-2)(3l + 4p-5) 
24 



c{£,£-l,p) = (l+p)(l + 2p)---(l + (£-2)p). 
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